DRAFT: Scalability Bugs: When 100-Node Testing is Not Enough
نویسندگان
چکیده
We highlight the problem of scalability bugs, a new class of bugs that appear in “cloud-scale” distributed systems. Scalability bugs are latent bugs that are clusterscale dependent, whose symptoms typically surface in large-scale deployments, but not in small or mediumscale deployments. The standard practice to test large distributed systems is to deploy them on a large number of machines (“real-scale testing”), which is difficult and expensive. New methods are needed to reduce developers’ burdens in finding, reproducing, and debugging scalability bugs. We propose “scale check,” an approach that helps developers find and replay scalability bugs at real scales, but do so only on one machine and still achieve a high accuracy (i.e., similar observed behaviors as if the nodes are deployed in real-scale testing).
منابع مشابه
Techniques for Identifying Elusive Corner-Case Bugs in Systems Software
Modern software is plagued by elusive corner-case bugs (e.g., security vulnerabilities). There are no scalable, automated ways of finding them, therefore such bugs can remain hidden until software is deployed in production. This thesis proposes approaches to solve this problem. First, we present black-box and white-box fault injection mechanisms, which allow developers to test the behavior of t...
متن کاملEnhancing Automated Fault Discovery and Analysis
ENHANCING AUTOMATED FAULT DISCOVERY AND ANALYSIS By Jared David DeMott Creating quality software is difficult. Likewise, offensive researchers look to penetrate quality software. Both parties benefit from a scalable bug hunting framework. Once bugs are found, an equally expensive task is debugging. To debug faults, analysts must identify statements involved in the failures and select suspicious...
متن کاملABHRANTA: Locating Bugs that Manifest at Large System Scales
A key challenge in developing large scale applications (both in system size and in input size) is finding bugs that are latent at the small scales of testing, only manifesting when a program is deployed at large scales. Traditional statistical techniques fail because no error-free run is available at deployment scales for training purposes. Prior work used scaling models to detect anomalous beh...
متن کاملTAC: A Topology-Aware Chord-based Peer-to-Peer Network
Among structured Peer-to-Peer systems, Chord has a general popularity due to its salient features like simplicity, high scalability, small path length with respect to network size, and flexibility on node join and departure. However, Chord doesn’t take into account the topology of underlying physical network when a new node is being added to the system, thus resulting in high routing late...
متن کاملUnderstanding , Detecting , and Repairing Performance Bugs
Software performance is critical for how end-users perceive the quality of software products. Performance bugs—programming errors that cause performance degradation—lead to poor user experience and low system throughput. Despite advances in profiling techniques, performance bugs still escape in production runs. There are two key reasons why performance bugs are not effectively detected during i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017